The Music Listening Histories Dataset
نویسندگان
چکیده
We introduce the Music Listening Histories Dataset (MLHD), a large-scale collection of music listening events assembled from more than 27 billion time-stamped logs extracted from Last.fm. The logs are organized in the form of listening histories per user, and have been conveniently preprocessed and cleaned. Attractive features of the MLHD are the self-declared metadata provided by users at the moment of registration whose identities have been anonymized, MusicBrainz identifiers for the music entities in each of the logs that allows for an easy linkage to other existing resources, and a set of user profiling features designed to describe aspects of their music listening behavior and activity. We describe the process of assembling the dataset, its content, its demographic characteristics, and discuss about the possible uses of this collection, which, currently, is the largest research dataset of this kind in the field.
منابع مشابه
The Million Musical Tweet Dataset - What We Can Learn From Microblogs
Microblogs and Social Media applications are continuously growing in spread and importance. Users of Twitter, the currently most popular platform for microblogging, create more than a billion posts (called tweets) every week. Among all the different types of information being shared, some people post their music listening behavior, which is why Twitter became interesting for the Music Informati...
متن کاملAutomatic Music Recommendation Systems: Do Demographic, Profiling, and Contextual Features Improve Their Performance?
Traditional automatic music recommendation systems’ performance typically rely on the accuracy of statistical models learned from past preferences of users on music items. However, additional sources of data such as demographic attributes of listeners, their listening behaviour, and their listening contexts encode information about listeners, and their listening habits, that may be used to impr...
متن کاملCombining Spotify and Twitter Data for Generating a Recent and Public Dataset for Music Recommendation
In this paper, we present a dataset based on publicly available information. It contains listening histories of Spotify users, who posted what they are listening at the moment on the micro blogging platform Twitter. The dataset was derived using the Twitter Streaming API and is updated regularly. To show an application of this dataset, we implement and evaluate a pure collaborative filtering ba...
متن کاملPersonality Traits and Music Genre Preferences: How Music Taste Varies Over Age Groups
Personality traits are increasingly being incorporated in systems to provide a personalized experience to the user. Current work focusing on identifying the relationship between personality and behavior, preferences, and needs oen do not take into account dierences between age groups. With music playing an important role in our lives, dierences between age groups may be especially prevalent....
متن کاملGaining Musical Insights: Visualizing Multiple Listening Histories
Listening histories are rich sources of implicit information. Combining listening histories of multiple users allows comparisons in musical taste and discovery of new music, though few existing work specifically addresses this issue. In this paper we present two interactive visualizations which give users a deeper insight into consent and dissent in their listening behaviors, and help them to c...
متن کامل